Verb-Particle Constructions in the World Wide Web

نویسنده

  • Aline Villavicencio
چکیده

In this paper we investigate the phenomenon of verb-particle constructions, discussing their characteristics and their availability for use with NLP systems. Combinations automatically extracted from corpora greatly improve the coverage of available resources. However, the data sparseness problem is particularly acute for these constructions and even using a corpus as large as the British National Corpus, a great proportion of combinations have a very low frequency, while others never occur in it. In this paper we propose using the World Wide Web as a way to validate candidate combinations minimising the problem of data sparseness. This method can be use to extend the coverage of existing lexical resources by validating combinations automatically generated from classes of verbs, and to improve the reliability of those combinations automatically extracted from corpora.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Verb-particle constructions in a computational grammar of English

In this paper we investigate the phenomenon of verb-particle constructions, discussing their characteristics and the challenges that they present for a computational grammar. We concentrate our discussion on the treatment adopted in a wide-coverage HPSG grammar: the LinGO ERG. Given the constantly growing number of verb-particle combinations, possible ways of extending this treatment are invest...

متن کامل

Guidelines for Propbank framers

Frame files provide guidelines for Propbank annotators and include a list of framesets, or coarse-grained senses of the verbs. A frameset stands for a set of syntactic frames. Following Levin 1993, we assume that the set of syntactic constructions or frames that a verb can occur in is a direct reflection of the underlying semantic components that restrict allowable arguments. A frameset thus co...

متن کامل

Identifying Verbal Collocations in Wikipedia Articles

In this paper, we focus on various methods for detecting verbal collocations, i.e. verb-particle constructions and light verb constructions in Wikipedia articles. Our results suggest that for verb-particle constructions, POS-tagging and restriction on the particle seem to yield the best result whereas the combination of POS-tagging, syntactic information and restrictions on the nominal and verb...

متن کامل

Verb-Particle Constructions And Lexical Resources

In this paper we investigate the phenomenon of verb-particle constructions, discussing their characteristics and their availability for use with NLP systems. We concentrate in particular on the coverage provided by some electronic resources. Given the constantly growing number of verb-particle combinations, possible ways of extending the coverage of the available resources are investigated, tak...

متن کامل

Statistical Techniques for Automatically Inferring the Semantics of Verb-Particle Constructions

This paper describes an investigation of some potential features for a statistical approach to inferring the semantics of verb-particle constructions from corpus data. Verb-particles cause particular problems for the computational semantic analysis of language, because their meaning often cannot be derived through the usual compositional methods of analysis. Two novel techniques are presented w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003